Multiple instrument tracking based on reconstruction error, pitch continuity and instrument activity
نویسندگان
چکیده
We present an algorithm for tracking individual instruments in polyphonic music recordings. The algorithm takes as input the instrument identities of the recording and uses non-negative matrix factorisation to compute an instrument-independent pitch activation function. The Viterbi algorithm is applied to find the most likely path through a number of candidate instrument and pitch combinations in each time frame. The transition probability of the Viterbi algorithm includes three different criteria: the frame-wise reconstruction error of the instrument combination, a pitch continuity measure that favours similar pitches in consecutive frames, and the activity status of each instrument. The method was evaluated on mixtures of 2 to 5 instruments and outperformed other state-of-the-art multi-instrument tracking methods.
منابع مشابه
A User-assisted Approach to Multiple Instrument Music Transcription
The task of automatic music transcription has been studied for several decades and is regarded as an enabling technology for a multitude of applications such as music retrieval and discovery, intelligent music processing and large-scale musicological analyses. It refers to the process of identifying the musical content of a performance and representing it in a symbolic format. Despite its long ...
متن کاملPolyphonic Note and Instrument Tracking Using Linear Dynamical Systems
In this paper, a system for automatic transcription of multiple-instrument polyphonic music is proposed, which supports tracking multiple concurrent notes using linear dynamical systems (LDS). The system is based on a spectrogram factorisation model which extends probabilistic latent component analysis (PLCA), and supports the detection of multiple pitches, instrument contributions, and pitch d...
متن کاملA Shift-Invariant Latent Variable Model for Automatic Music Transcription
In this work, a probabilistic model for multiple-instrument automatic music transcription is proposed. The model extends the shift-invariant probabilistic latent component analysis method, which is used for spectrogram factorization. Proposed extensions support the use of multiple spectral templates per pitch and per instrument source, as well as a time-varying pitch contribution for each sourc...
متن کاملExplicit Duration Hidden Markov Models for Multiple-Instrument Polyphonic Music Transcription
In this paper, a method for multiple-instrument automatic music transcription is proposed that models the temporal evolution and duration of tones. The proposed model supports the use of spectral templates per pitch and instrument which correspond to sound states such as attack, sustain, and decay. Pitch-wise explicit duration hidden Markov models (EDHMMs) are integrated into a convolutive prob...
متن کاملSecond Fiddle is Important Too: Pitch Tracking Individual Voices in Polyphonic Music
Recently, there has been much interest in automatic pitch estimation and note tracking of polyphonic music. To date, however, most techniques produce a representation where pitch estimates are not associated with any particular instrument or voice. Therefore, the actual tracks for each instrument are not readily accessible. Access to individual tracks is needed for more complete music transcrip...
متن کامل